Azure Image OCR #228

dkotter · 2020-10-06T14:31:47Z

Description of the Change

Adds a new setting that is turned off by default but can be turned on to run OCR scanning on images. When an image is initially uploaded, we run a few checks on that image. First, we check if the image matches one of the supported types (JPEG, PNG, GIF, BMP). If so, we then check if we have a previous image scan and if so, we check if we have either the handwriting or text tags set, with a high confidence level (above 90%). If all those checks pass, we then run OCR scanning.

If we get a successful response back, we then parse the text out of that response, save that text to wp_content for that image and then save the full response to post meta.

We also add scan/recan functionality to the media modal and single media edit screens. If a scan is run from here, we bypass our checks, as we assume if someone is manually starting a scan, they don't care about the checks. Also added some better error handling to the scan/rescan buttons, so if a scan fails, we don't continue to show the loading icon but instead remove that, keep the button disabled and change the button text to say error (ideally we would add even better error handling in a separate PR). This is now in a separate PR: #231

TODO: last piece is to add describedby text into the content when needed. Need to support the block editor and the classic editor

Alternate Designs

Benefits

Images with text can now have that text automatically read and then inserted into the content, with the proper describedby tags. This helps provide more context for images, especially images that are screenshots of text (like social media posts).

Possible Drawbacks

With this feature turned on, will cause a slight slowdown (seems like roughly 5 extra seconds), in image processing.

Verification Process

Checklist:

I have read the CONTRIBUTING document.
My code follows the code style of this project.
My change requires a change to the documentation.
I have updated the documentation accordingly.
I have added tests to cover my change.
All new and existing tests passed.

Applicable Issues

#111

…erate attatchment metadata hook, that will kick off OCR processing. Add a new REST endpoint that will also utilize this same function. Add an OCR class that does all the processing. Currently only process PNG images.

…odal and single media edit screens. Bypass mime check if recan button or option is used. Add filter around approved media types

…ensions. Add support for the four file types azure supports. Some code cleanup

…out supported image types besides PNG for now

… button, to populate the image description after success.

…id errors. Remove local only code

…ng. Pass the previously run image scan into our OCR function and utilize that to determine if the image needs OCR run or not.

jeffpaul · 2020-10-15T16:38:58Z

Noting here that @dinhtungdu is going to work on scaffolding in the Gutenberg bits

…f should only turn off the automatic scanning, not the manual scanning

…te PR

…encodings properly, so we don't end up with weird characters. Keep track of if we need to modify the content or not, so if we don't, we can just return the original and not risk messing anything up. Minor code formatting

…at WordPress itself does in other places. This will hopefully be more lenient across environments and encoding types

src/js/editor-ocr.js

helen · 2020-10-24T02:31:29Z

This is feeling pretty good to me for a first run. I think what we might want to do is add a field to wp_prepare_attachment_for_js() that's a bool for whether classifai_computer_vision_ocr is non-empty (like classifaiHasOcr or something) and only show the prompt in that case, not just if the description field is populated because that description field could be populated manually or from unrelated EXIF data.

src/js/editor-ocr.js

…nto feature/azure-image-ocr

dinhtungdu · 2020-10-28T04:59:55Z

@helen I fixed the REST API response issue. This PR is working on my live site.

…art cropping button. Make sure we have HTML elements before trying to add event handlers to them

…y the modal view

This way any additional line breaks will keep the single block with its ID intact. Also massages the message in the modal.

Didn't realize verse was a pre without line wrapping :(

CSS isn't yet building into `dist`

This is in-progress, see code comments for needs

… the classname we want to add. Minor linting fixes

This allows for a number of customizations, such as being able to store the text results in a different field in case you use post_content extensively in the editorial workflow already, or set other meta, or update the alt text based on the text results, and so forth. Also adds the full $scan data as context for the `classfai_ocr_text` fitler.

…ock is selected

…nto feature/azure-image-ocr

… only used for editor highlighting

fix: switch to use internal style

helen

Nice work here, everybody!

Darin Kotter added 8 commits October 1, 2020 13:04

Add in setting to turn OCR on/off. Add recan functionality to media m…

a73f017

…odal and single media edit screens. Bypass mime check if recan button or option is used. Add filter around approved media types

Add helper function to find image that matches both file size and dim…

a53b66d

…ensions. Add support for the four file types azure supports. Some code cleanup

Don't show OCR rescan options if that setting is turned off. Comment …

5ffc932

…out supported image types besides PNG for now

Better error handling for failed requests. Add callback to OCR rescan…

84aac91

… button, to populate the image description after success.

Make sure we return proper values from our generate functions, to avo…

93c8693

…id errors. Remove local only code

Move OCR scanning into the same function that does other image scanni…

cca2e0b

…ng. Pass the previously run image scan into our OCR function and utilize that to determine if the image needs OCR run or not.

Don't store full response unless we have actual text to save

10c31a8

jeffpaul assigned dkotter Oct 6, 2020

jeffpaul requested a review from helen October 6, 2020 15:54

jeffpaul added this to the 1.6.0 milestone Oct 6, 2020

jeffpaul added the type:enhancement New feature or request. label Oct 6, 2020

Update docblocks

ccf9760

jeffpaul assigned dinhtungdu Oct 15, 2020

Darin Kotter and others added 6 commits October 15, 2020 11:26

Don't remove the OCR scan button if OCR is turned off. Turning OCR of…

02425b5

…f should only turn off the automatic scanning, not the manual scanning

Revert the extra error handling added here, as that's now in a separa…

e2bff1e

…te PR

feat: auto insert description generated by ocr after image block

4f2b000

fix: only enqueue editor script if ocr is enabled

3894f2f

Merge branch 'develop' into feature/azure-image-ocr

c3ac120

Update OCR endpoint to the newly release v3.1

fe46509

This was referenced Oct 21, 2020

Release version 1.6.0 #240

Closed

Update JavaScript build process to work better with Gutenberg #209

Closed

dinhtungdu and others added 4 commits October 23, 2020 15:13

feat: OCR modal and OCR sidebar button

b6df398

Fix tests

1d7aa99

Switch to a different approach to handle encoding issues, to match wh…

ab9705b

…at WordPress itself does in other places. This will hopefully be more lenient across environments and encoding types

helen reviewed Oct 24, 2020

View reviewed changes

src/js/editor-ocr.js Outdated Show resolved Hide resolved

helen reviewed Oct 24, 2020

View reviewed changes

src/js/editor-ocr.js Outdated Show resolved Hide resolved

dinhtungdu added 2 commits October 28, 2020 11:46

fix: ocr status in media api response

9915bed

Merge branch 'feature/azure-image-ocr' of github.com:10up/classifai i…

88fac65

…nto feature/azure-image-ocr

helen and others added 22 commits October 28, 2020 19:58

Merge branch 'develop' into feature/azure-image-ocr

0f536b2

Make sure the OCR manual scan button is added independently of the sm…

3b13b2b

…art cropping button. Make sure we have HTML elements before trying to add event handlers to them

Make sure we don't show the scan buttons on the single edit view, onl…

7e5176a

…y the modal view

Insert a verse block instead of a paragraph.

1a85611

This way any additional line breaks will keep the single block with its ID intact. Also massages the message in the modal.

Go back to paragraph block for now.

0e50784

Didn't realize verse was a pre without line wrapping :(

fix: use group block for scanned text block

dd0e8bb

ci: use composer v1

b44ca65

fix: only allow one ocr block per image

c07a1ed

Create block style for group for editor styling purposes

d69dd98

CSS isn't yet building into `dist`

Commas, sigh.

be0ab73

Better CSS for highlight border

7ee5b72

Highlight related image/OCR block when editor is focused on the other.

94abde8

This is in-progress, see code comments for needs

Add the classnames utility in order to merge existing classnames with…

9afc529

… the classname we want to add. Minor linting fixes

Remove the ocr-related-block class when elements aren't selected anymore

c2778dc

Make sure we don't constantly set and remove the class if an image bl…

be312f7

…ock is selected

Merge branch 'feature/azure-image-ocr' of github.com:10up/classifai i…

ccfa4ea

…nto feature/azure-image-ocr

Filter post content before it's saved to remove our OCR class that is…

2ac56fe

… only used for editor highlighting

fix: switch to use internal style

db0a272

fix: more sensible timeout

df91eb2

fix: deal with different backgrounds

af6b4b0

Merge pull request #257 from 10up/try/ocr-internal-style

e477cd2

fix: switch to use internal style

helen approved these changes Nov 2, 2020

View reviewed changes

jeffpaul linked an issue Nov 2, 2020 that may be closed by this pull request

Integrate Azure Computer Vision for OCR text generation for uploaded files #111

Closed

dinhtungdu merged commit 1c26abd into develop Nov 2, 2020

dinhtungdu deleted the feature/azure-image-ocr branch November 2, 2020 16:31

dkotter mentioned this pull request Jan 7, 2021

Visually highlight insecure assets in the editor 10up/insecure-content-warning#9

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Azure Image OCR #228

Azure Image OCR #228

dkotter commented Oct 6, 2020 •

edited

Loading

jeffpaul commented Oct 15, 2020

helen commented Oct 24, 2020

dinhtungdu commented Oct 28, 2020

helen left a comment

Azure Image OCR #228

Azure Image OCR #228

Conversation

dkotter commented Oct 6, 2020 • edited Loading

Description of the Change

Alternate Designs

Benefits

Possible Drawbacks

Verification Process

Checklist:

Applicable Issues

jeffpaul commented Oct 15, 2020

helen commented Oct 24, 2020

dinhtungdu commented Oct 28, 2020

helen left a comment

Choose a reason for hiding this comment

dkotter commented Oct 6, 2020 •

edited

Loading